Prediction of Protein Secondary Structure from PDB Structure Information Based on Sequence Segments Homology Searching

نویسندگان

  • Shouji Tatsumoto
  • Kenji Satou
  • Akihiko Konagaya
چکیده

In this paper, a novel method to predict protein secondary structure (e.g., helix, beta-sheet and coil) is described. Our method predicts the secondary structure for a query sequence using a segment-wise similarity search, which finds the most probable secondary structure based on similarities between a set of sequence segments of a query sequence and our segment databases: the segment sequence DB and the segment structure DB. The important points concerning our system are: (i) capability of visualizing evidence for the prediction of a query sequence, (ii) higher prediction accuracy in regard to beta-sheet than those of existing methods. Since the existing test set (e.g., the RD126 set) is not applicable to our system for performance evaluation, we used an original blind test set (similar to CASP) which included 355 non-homologous protein chains. The performance of our system yields a 76.9% accuracy of secondary structure prediction which is up to 20% greater than other prediction methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches

DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...

متن کامل

In Silico and in Vitroinvestigations on cry4aand cry11atoxins of Bacillus thuringiensis var Israelensis

In the present study we attempted to correlate the structure and function of the cry11a (72 kDa) and cry4a (135 kDa) proteins of Bacillus thuringiensis var israelensis. Homology modeling and secondary structure predictions were done to locate most probable regions for finding helices or strands in these proteins. The JPRED (JPRED consensus secondary structure prediction server) secondary struct...

متن کامل

Predicting the Three-Dimensional Structures of Proteins: Combined Alignment Approach

Protein structure prediction is a great challenge in molecular biophysics and bioinformatics. Most approaches to structure prediction use known structure information from the Protein Data Bank (PDB). In these approaches, it is most crucial to find a homologous protein (template) from the PDB to a query sequence and to align the query sequence to the template sequence. We propose a profile-profi...

متن کامل

Representative Protein Sequence and Structure Database

The database provides the information about the non-redundant protein dataset (1573 proteins) obtained from the Protein Data Bank. The information includes PDB ID, Length of the protein, Resolution, PDB Secondary structure, PDB secondary structure summary, PHD secondary structure prediction, PHD secondary structure prediction summary, sequence. We further revised the PDB Secondary structure sum...

متن کامل

TOPITS: Threading One-Dimensional Predictions Into Three-Dimensional Structures

Homology modelling, currently, is the only theoretical tool which can successfully predict protein 3D structure. As 3D structure is conserved in sequence families, homology modelling allows to predict 3D structure for 20% of SWISSPROT. 20% of the proteins in PDB are remote homologues to another PDB protein. Threading techniques attempt to predict such remote homologues based on sequence informa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004